|
|
|
A special XML file for RTP streaming must initiate Audio push.
<?xml version="1.0"?>
<Response>
<Audio
packetsize=”10|20|30|40|50|60”
codec = “PCMU | PCMA”
>
<AudioTimer value="30"/>
<Url href=”RTPRx://remote_ip_address:remote_port”/>
<Promptline>
This text goes on the Prompt Line
</Promptline>
</Audio>
</Response>
Each <Response> can contain only one <Audio> tag with the following attributes:
Each <Audio> tag can contain <AudioTimer> and <Url> tags. The <AudioTimer> tag is used as an inter-packet timer. This timer is set every time a packet is received. After an administrable duration where no packets are received the RTP stream is terminated. This tag has the following attributes:
| Attribute |
Value |
Description |
| value |
X (seconds) |
Default is 20 seconds. The range is 5 to 30 seconds. |
Each <Url> tag consists of information for the RTP streaming server and the local receive port of the telephone. The <Url> tag has the following attributes:
| Attribute |
Value |
Description |
| href |
string |
RTP Streaming URI Format. |
The following reserved URIs can be used in the href attribute to control an audio stream:
NOTE: For example, if an audio stream originator wanted to explicitly stop an audio stream the following would be sent in new Push Content:
Audio quality depends on the streaming source providing the audio at an appropriate pace. The pace depends on the packet size. If you are using 40ms packets, then the packets should be separated by 40ms.
Transmitting the packets too slowly results in odd silences when the telephone has no audio to play out its Speaker. Transmitting the packets too quickly results in broken sound, as the telephone is forced to drop packets it cannot maintain in its buffer.
The error in the pace measured at the telephone is called jitter. If the jitter stays below the size of one packet, then jitter does not impact the audio quality. Note that a +2ms jitter on one packet cancels out a -2ms jitter on the next. However, 40ms packets, each separated by 38ms, means that the jitter grows +2ms with each packet. After 3 seconds, the jitter would be +150ms, and the telephone would need to drop audio to maintain its buffers.
|
|
|